Privacy Measures for Free Text Documents: Bridging the Gap between Theory and Practice

نویسندگان

  • Liqiang Geng
  • Yonghua You
  • Yunli Wang
  • Hongyu Liu
چکیده

Privacy compliance for free text documents is a challenge facing many organizations. Named entity recognition techniques and machine learning methods can be used to detect private information, such as personally identifiable information (PII) and personal health information (PHI) in free text documents. However, these methods cannot measure the level of privacy embodied in the documents. In this paper, we propose a framework to measure the privacy content in free text documents. The measure consists of two factors: the probability that the text can be used to uniquely identify a person and the degree of sensitivity of the private entities associated with the person. We then instantiate the framework in the scenario of detection and protection of PHI in medical records, which is a challenge for many hospitals, clinics, and other medical institutions. We did experiments on a real dataset to show the effectiveness of the proposed measure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

نظریه پردازی بر فرآیند انتقال دانش نظری به حوزه عمل در پرستاری: رویکرد گراندد تئوری

Introduction & Objective: Knowledge transfer and in fact, the bridging of theory and practice is one of the main concerns of all academic disciplines. Getting prominent professional status is the thing that can be achieved by knowledge-based function, and of which would be called as successful discipline that it be able to transfer its theoretical paradigmatic claims into practice. Accordingly,...

متن کامل

Bridging the Gap Between Research and Policy and Practice; Comment on “CIHR Health System Impact Fellows: Reflections on ‘Driving Change’ Within the Health System”

Far too often, there is a gap between research and policy and practice. Too much research is undertaken with little relevance to real life problems or its reported in ways that are obscure and impenetrable. At the same time, many policies are developed and implemented but are untouched by, or even contrary to evidence. An accompanying paper describes an innovative progr...

متن کامل

Cross border E-Science and Research Partnership: Bridging the Gap Between Science and Media

  E-Science is a tool that helps scientists to store, interpret, analyze and make a network of their data, and it can play a critical role in different aspects of the scientific goals and research. This commentary, under the topic of Cross Border E-Science and Research Partnership: Bridging the Gap between Science and Media,[1] attempts to shed light on E-Science with emphasis on three importa...

متن کامل

Analyzing Vocabulary Intersections of Expert Annotations and Topic Models for Data Practices in Privacy Policies

Privacy policies are commonly used to inform users about the data collection and use practices of websites, mobile apps, and other products and services. However, the average Internet user struggles to understand the contents of these documents and generally does not read them. Natural language and machine learning techniques offer the promise of automatically extracting relevant statements fro...

متن کامل

The effects of clinical supervision program application on nursing

Background. Practice is where skills are developed and where theory must pass testing but there is a gap between classroom education and practice in nursing. The clinical supervision is a demonstration project of advanced practice and bridging the gap between theory and practice. Clinical supervision is a mechanism supporting both professional and personal development. The main purpose of this ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011